For Pitch Extraction
نویسندگان
چکیده
A synthesis-based method for pitch extraction of the speech signal is proposed. The method synthesizes a numb¢: of log power spectra for different values of fundamental frequency and compares them with the log power spectrum of the input speech segment. The average magnitude (AM) difference between the two spectra is used for comparison. The value of fundamental frequency that gives the minimum AM difference between the synthesized spectrum and the input spectrum is chosen as the estimated value of fundamental frequency. The voiced/unvoiced decision is made on the basis of the value of the AM difference at the minimum. For synthesizing the log power spectrum, the speech signal is assumed to be the output of an all-pole filter. The transfer function of the all-pole filter is estimated from the input speech segment by using the autocorrelation method of linear prediction. The synthesis-based method is tried out on real speech data and the results are discussed. R6sum6. Une m6thode d'extraction de ia fondamentale du signal de parole bas6e sur ia synth~se est proposb.e. Elle procb, de par la synth6se d'un certain nombre de spectres iogarithmiques de puissance pour diff6rentes valeurs de la fr6quence fondamentale, et pour ia comparaison de ces spectres avec le spectre logarithmique de puissance du segment de parole analys6. La diff6rence d'amplitude moyenne (AM) entre les deux spectres est utilis6e pour ia comparaison. La valeur de la fr6quence fondamentale qui fournit ie minimum de diff6rence AM entre le spectre synth6tis6 et ie spectre du signal d'entr6e est choisie comme estimation de ia hauteur. La d6cision vois6/non-voise est 6tablie sur la valeur de la diff6rence AM au minimum. Pour synth6tiser le spectre de puissance, le signal de parole est suppos6 repr6senter la sortie d'un filtre tout pble. La fonction de transfert du fiitre tout p61e est estim6e ~t partir du segment de parole par la m6thode d'autocorr61ation de ia pr6diction lin6aire. Cette m6thode bas6e sur la synth6se est test6e sur des segments de parole naturelle et les r6sultats sont discut6s. Introduction about speech segments and the pitch period (or its inverse, the fundamental frequency) is estimated Pitch extraction is an important problem in for the voiced speech segments. The pitch contours speech analysis, in pitch extraction of a speech of speech utterances are useful in various speech utterance, voiced/unvoiced decisions are made processing applications such as speech analysis-synthesis [!,2], speech understanding [31 and
منابع مشابه
Melody pitch estimation based on range estimation and candidate extraction using harmonic structure model
This paper proposes an algorithm to estimate the melody pitch line (the most dominant pitch sequence) of a given polyphonic audio based on melody range estimation and pitch candidate extraction using a harmonic structure model similar to that proposed by Goto. This paper defines melody pitch candidate as a list of pitch candidates that produces the best-fit harmonic models to the polyphonic aud...
متن کاملStatistical Characterisation of Melodic Pitch Contours and its Application for Melody Extraction
In this paper we present a method for the statistical characterisation of melodic pitch contours, and apply it to automatic melody extraction from polyphonic music signals. Within the context of melody extraction, pitch contours represent time and frequency continuous sequences of pitch candidates out of which the melody must be selected. In previous studies we presented a melody extraction alg...
متن کاملClassification of Iranian Traditional Music Dastgahs Using Features Based on Pitch Frequency
The Iranian traditional music is composed of seven majors Dastgahs: Chahargah, Homayoun, Mahour, Segah, Shour, Nava, and Rast-Panjgah. In this paper, a new algorithm for the classification of the Iranian traditional music Dastgahs based on pitch frequency is proposed. In this algorithm, the features of Lagrange coefficients of pitch logarithm (LCPL), Fuzzy similarity sets type 2 (FSST2), and th...
متن کاملProduction of Mesophase Pitch from Coal Tar and Petroleum Pitches using Supercritical Fluid Extraction
Supercritical fluid extraction (SFE) is currently being investigated as a possible technique in the production of high quality mesophase pitch from coal tar and petroleum pitches. Mesophase pitch is used to make high technology products, such as carbon fibre. The conventional production of mesophase pitch initially involves the removal of low molecular weight species from coal tar and petroleum...
متن کاملThe Pitch Extraction Method through Spectrum Flattening
The exact pitch(fundamental frequency) extraction is important in speech signal processing like speech recognition, speech analysis and synthesis. However the exact pitch extraction from speech signal is very difficult due to the effect of formant and transitional amplitude. So in this paper, the pitch is detected after the elimination of formant ingredients by flattening the spectrum in freque...
متن کاملWeighted autocorrelation for pitch extraction of noisy speech
In this paper, we propose a modified version of the autocorrelation pitch extraction method well known to be robust against noise. Utilizing that the average magnitude difference function (AMDF) has similar characteristics with the autocorrelation function, the autocorrelation function is weighted by the reciprocal of the AMDF. By simulation experiments, it is shown that the proposed pitch extr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1982